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Abstract 

In Q , Berstel proved that the Arshon sequence cannot be obtained 
by iteration of a morphism. An alternative proof of this fact is given 
here. The cr-sequence was constructed by Evdokimov in order to con- 
struct chains of maximal length in the n-dimensional unit cube. It 
turns out that the cr-sequence has a close connection to the Dragon 
curve H . We prove that the cr-sequence can not be defined by iteration 
of a morphism. 



1 Introduction and Background 

In 1937, Arshon gave a construction of a sequence of symbols w over the 
alphabet {1,2,3}, constructed as follows: Let Wi = 1. For k > 1, Wk+i is 
obtained by replacing the letters of in odd positions thus: 

1 -> 123, 2 231, 3 -> 312 

and in even positions thus: 

1 -> 321, 2 -> 132, 3 -> 213. 

Then 

w 2 = 123, w 3 = 123132312, 

and each iWj is the initial subword of ifj+i, so the infinite symbolic sequence 
w = lim w n is well denned. It is called the Arshon sequence. 
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This method of constructing w is called the Arshon Method (AM), and 
■0 will denote the indicated map of the letters 1, 2, 3 according to position 
as described above. 

We will denote the natural decomposition of w in 3-blocks by lower braces: 

w =123, 132 v 312 / . . . . 

The paper by Arshon was published in connection with the problem 
of constructing a nonrepetitive sequence on a 3-letter alphabet, that is, a 
sequence that does not contain any subwords of the type XX = X 2 , where 
X is any non-empty word of a 3-letter alphabet. The sequence w has that 
property. The question of the existence of such a sequence, as well as the 
questions of the existence of sequences avoiding repetitions in other meanings, 
were studied in algebra [|T], |^, [TTJ, discrete analysis P, [JJ and in dynamical 



systems [14 



Any natural number n can be presented unambiguously as n = 2*(4s + cr), 
where a < 4, and t is the greatest natural number such that 2* divides n. If 
n runs through the natural numbers then a runs through the sequence that 
we will call the a-sequence. We let w a denote that sequence. Obviously, w a 
consists of Is and 3s. The initial letters of w a are 11311331113313 .... 

In H |I7| , Evdokimov constructed chains of maximal length in the n- 
dimensional unit cube using the cr-sequence. Originally, the cr-sequence was 
defined by the following inductive scheme: 

d = 1, D l = 3 
Ck+i = Cfel-Dfc, D k+ i = Ck3D k 
k = l,2,... 

and w a = lim C k . 

k— »oo 

Our definition above of the cr-sequence is equivalent to this one. 

The motivation for studying the cr-sequence is that this sequence has a 
close connection to the well-known Dragon curve, discovered by physicist 
John E. Heighway and defined as follows: we fold a sheet of paper in half, 
then fold in half again, and again, etc. and then unfold in such way that each 
crease created by the folding process is opened out into a 90-degree angle. 
The "curve" refers to the shape of the partially unfolded paper as seen edge 
on. If one travels along the curve, some of the creases will represent turns 
to the left and others turns to the right. Now if 1 indicates a turn to the 
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right, and 3 to the left, and we start travelling along the curve indicating the 
turns, we get the a-sequence ||. 



Let E be an alphabet and E* be the set of all words of E. A map (p : E* — » 
£* is called a morphism, if we have (p(uv) = ip(u)p(v) for any u,v G E*. It 
easy to see that a morphism ip can be defined by defining (p(i) for each z e £. 
The set of all rules i — > cp(i) is called a substitution system. We create words 
by starting with a letter from the alphabet £ and iterating the substitution 
system. Such a substitution system is called a .D&L (Deterministic, with 
no context Lindenmayer) system [0. DOL systems are a classical object of 
Formal Language Theory They are interesting from mathematical point of 



view, but also have applications in theoretical biology [III 



Suppose a word <p(a) begins with a for some a e E, and that the length of 
(p k (a) increases without bound. The symbolic sequence lim p k (a) is called 

fe— >oo 

a /ixed point of the morphism ip. 

We now study classes of sequences that are defined by iterative schemes. 



There are many techniques to study sequences generated by morphisms |T3 



It is reasonable to try to determine if a sequence under consideration can be 
obtained by iteration of a morphism. 

Since the construction of the Arshon sequence w is similar to the iterated 
morphism scheme, and because w is constructed by two morphisms fi and 
f 2 , applied depending on whether the letter position is even or odd, we might 
expect that there exists a morphism / which generates w. But this turns 
out not to be true. In ||, Berstel proved that the Arshon sequence cannot 
be obtained by iteration of a morphism. However, Berstel's proof appeals to 
advanced machinery, namely a sort of Wilf-Fine theorem due to Cobham Q . 
In Theorem 1, we show that such machinery is not needed here, and offer a 
simple and direct proof. 

Naturally a question arises as to the possibility of constructing w a using 
the iteration of a morphism, since such a construction could help us in study- 
ing w a , and thus in studying the Dragon curve. This also turns out not to 
be true, as shown in Theorem 10. 



2 The Arshon Sequence 

Theorem 1. There does not exist a morphism whose fixed point is the Ar- 
shon sequence. 
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Note. A corollary of this theorem is the non-existence of a morphism 
whose iteration gives the Arshon sequence. In fact, if such a morphism 
exists, it must have the property that 1 is mapped to IX by the action of the 
morphism, where X is some word, and from this it follows that the Arshon 
sequence is a fixed point of this morphism. 

Proof. It is enough to prove the non-existence of a morphism / with the 
property w = f(w), since from the definition of a fixed point we have that 
if to is a fixed point of the morphism / then w = f(w). Suppose there 
exists a morphism / such that /(l) = X, /(2) = Y, /(3) = Z and w = 
f(w). Moreover, in order to avoid dealing with the identity morphism, we 
assume that |/(123)| > 3. We do this since w is a fixed point of the identity 
morphism, but iterations of this morphism obviously do not give us the 
Arshon sequence. From all such morphisms / we choose a morphism with 
minimal length of X. 

Clearly not all of \X\, \Y\ and \Z\ can be 0. Suppose for example then that 
| A" | = 0, \YZ\ ^ 0. Then w contains the subword /(123132) = XYZXZY = 
YZZY. If \Z\ 7^ then w contains the repetition ZZ. Otherwise w contains 
repetition YY. This is impossible, hence \X\ ^ 0. One can check in the same 
way that \Y\ ^ and \Z\ ^ 0. Now \X\ + \Y\ + \Z\ ^ 3, since otherwise 
|/^(1)| = 1 for £ = 1, 2..., and w is not a fixed point of the morphism /. Now 



hence X consists of \X\ of the first letters of w, Y is \Y\ of the following 
letters, and Z is \Z\ of the letters following that. 

We will use upper braces to show the decomposition of w into /-blocks 
(that is, to show the disposition of the words X, Y and Z in w). We have 



where all a, are letters of the alphabet {1,2,3}. 
Lemma 2. We have \X\ + \Y\ + \Z\ = (mod 3). 

Proof. From the structure of w, the frequencies of 1, 2, 3 in to coincide, hence 
the frequencies of these letters in f(w) = w coincide as well. But this is only 
possible when \X\ + |y| + \Z\ = (mod 3). Indeed, otherwise there are two 
letters, whose frequencies in /(123) = XYZ do not coincide, which implies 



f( w ) = w = XYZXZY ZXY ... , 



Y Z X 
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that the frequencies of these letters in f(w) = w do not coincide as well, 
since w can be written as w = W1W2W3 . . . , where Wi is a permutation of 
the letters X, Y and Z. □ 

Lemma 3. The situation \X\ = \Y\ = \Z\ = (mod 3) is impossible. 

Proof. Suppose |X| = \Y\ = \Z\ = (mod 3). Then X, Y and Z consist of 
a whole number of 3-blocks. The properties of ift mean that the morphism g 
given by g(l) = f 1 ^), g{2) = ^~\Y), g(3) = i>~\Z) satisfies w = g(w). 
By the minimality of /, |c/(123)| = 3, so that 5(123) = 123. This implies 
that /(l) = 123, /(2) = 132, /(3) = 312. However, this impossible, since 
then /(1231) = 123132312123 is not a prefix w. □ 

We define the Nth f -block X to be the block X obtained by considering 
the decomposition of w into /-blocks and then taking the iVth block that 
corresponds to the word X (possibly skipping some blocks that corresponds 
to the words Y and Z without counting them). Thus, for example, when 
we say "the 4th /-block X" , we mean not the following marked block w = 
XYZXZY . . . , but the following marked block w = XYZXZYZXYZYX . . . 

Lemma 4. With the assumption of the existence of the morphism f , \X\ < 5. 

Proof. Suppose \X\ > 6, that is, X = 123132.... If \X\ = 2 (mod 3) 
(\X\ = 1 (mod 3)), then \X\ > 7 and using Lemma 2 we consider the 4th 
/-block X = 12,313,. . . (X = 1^23^323,...)- This contradicts the AM. 
Hence \X\ = (mod 3). 

It follows from Lemma 3 that the situation |Vj = (mod 3) is impossible. 
If \Y\ = 1 (mod 3) = 2 (mod 3)), then we consider the 10th (3rd) /- 
block X = 12^313,2 . . . and it brings us to a contradiction with the AM. 

Hence if \X\ > 6 then the morphism / can not exist. □ 

Lemma 5. With the assumption of the existence of the morphism f , \X\ 7^ 1. 

Proof. If \X\ = 1, then X = 1 and the length of the words f k (l) for k = 
1,2,... does not increase, whence w is not a fixed point of the morphism /. 
This is a contradiction. □ 

Lemma 6. With the assumption of the existence of the morphism f , \X\ 7^ 2. 
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Proof. Suppose \X\ = 2, that is X = 12. 

We have \X\ = 2 (mod 3), hence, using Lemma 2, we have \Y\ + \Z\ = 1 
(mod 3). 

We consider the 2nd /-block X and the /-block Z next after it. It can be 
seen that Z begins with 3. We consider the 4th /-block X and Y preceding it 
and find that Y ends with 3. But then, considering YZ, which is a subword 
of w, we see, that 33 is a subword of w, which is impossible. That is for 
\X\ — 2 the morphism / cannot exist. □ 

The 3-blocks 123, 231, 312 are said to be odd 3-blocks. All other 3-blocks 
are said to be even. 

Lemma 7. With the assumption of the existence of the morphism f , \X\ ^ 3 

Proof. Suppose \X\ = 3, that is X = 123. 

We have \X\ = (mod 3), hence, using Lemma 2 we have \Y\ + \Z\ = 
(mod 3). Considering the AM, the 2nd /-block X must be an odd 3-block, 
hence \Y\ + \Z\ = 1 (mod 2). 

Let \Z\ > 2. Then the 2nd /-block Z begins with an even 3-block, and 
the 3rd Z begins with an odd 3-block. This is impossible. (Note that 2 
letters define the evenness of the 3-block unambiguously.) Thus \Z\ = 1. 

Let \Y\ > 2. In XYZ (or in an arbitrary permutation of these letters) 
there is an even number of 3-blocks, so the 9th /-block Y begins with an odd 
3-block, but the 1st Y begins with an even 3-block. Hence \Y\ = 1. 

This is a contradiction with \Y\ + \Z\ = (mod 3) (and also a contra- 
diction with \Y\ + \Z\ = 1 (mod 2)). That is for \X\ = 3 the morphism / 
cannot exist. □ 

Lemma 8. With the assumption of the existence of the morphism f , \X\ ^ 4 

Proof. Suppose \X\ = 4, that is X = 1231. 

We have \X\ = 1 (mod 3), hence, using Lemma 2, we have \Y\ + \Z\ = 2 
(mod 3). 

We have \Y\ > 2, since otherwise Y = 3 and hence XYX which is a 
subword of w, contains 3131, which is impossible. Hence Y = 32.... We 
consider ZX and ZY and see that Z ends with 2, since otherwise w contains 
Zl and Z3, and thus the square 11 or 33. Now \Z\ > 2, since otherwise Z = 2 
and XZX which is a subword of w, contains 1212, which is impossible. Hence 
Z — ... 32, or Z — ... 12. The former is impossible since 3232 is contained 
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in ZY, and hence in w. The latter is impossible too, since considering the 
9th /-block Z and the /-block X following it, we obtain ZX = . . . v L2^ v 231 / , 

which contradicts the AM. That is for \X\ = 4 the morphism / cannot 
exist. □ 



Lemma 9. With the assumption of the existence of the morphism f , \X\ ^ 5 

Proof. Suppose \X\ = 5, that is X = 12313. 

We have \X\ = 2 (mod 3), hence, using Lemma 2, we have \Y\ + \Z\ = 1 
(mod 3). Then the 4th /-block is X = 12^313^, which is a contradiction with 

the AM. That is if \X\ = 5 then the morphism / cannot exist. □ 

From Lemmas 4 - 9 we have a contradiction with the assumption of the 
existence of the morphism /. This proves Theorem 1. □ 

Remark. In 0, Arshon gave the construction of a nonrepetitive sequence 
w n for an n-letter alphabet, where n is any natural number greater than or 
equal to 3. It is easy to see that, for even n, there exists a morphism /„ that 
defines w n . Namely, for 1 < i < n, one has: 



fn(i) 



z(i + 1) . . . nl2 ... z — 1, if i is odd, 

(i — 1) (i — 2) . . . ln(n — 1) . . . i, if i is even. 



Theorem 1 shows that for n = 3 such a morphism does not exist. However, 
whether there exists a morphism defining w n for arbitrary odd n is still an 
open question. 



3 The cr-sequence 

Theorem 10. There does not exist a morphism whose iteration defines the 
sequence w a . 

Proof. Suppose there exists a morphism /, such that /(l) = X, /(3) = Y 
and w a = lim / fc (l). Obviously, X consists of the first \X\ letters of w, 

k^oo 

where |X| is the length of X. 

Lemma 11. The subsequence ofw a consisting of the letters in odd positions 
is the alternating sequence of Is and 3s: 1313131 .... 
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Proof. The odd positions of w a correspond to the odd numbers n = 2° (4s + 
a) = 4s + a, so clearly a alternates between 1 and 3. □ 



Lemma 12. If there exists a morphism f whose iteration gives w a then 
\X\ = (mod 4). 

Proof. It is easy to see that /(l) = 1X^\ where \X^\ > 1, since otherwise 
|/ fc (l)| = 1, for k — 1, 2, 3 . . . , so w a cannot be obtained by iterating /. 

Suppose \X^\ = 1, that is /(l) = 11. But then w a consists of Is only, 
which is impossible, hence /(l) = HX^, where |X^ 2 ^| > 1. 

Suppose \X^\ = 1, that is /(l) = 113. Since w„ has the subword 111, 
then w a has a subword /(111) = 113113113. If /(111) begins with a letter in 
an odd position, then the marked letters 113113113, read from left to right 
will make up consecutive letters of w a in odd positions. This contradicts 
Lemma 11. If /(111) begins with a letter in an even position, then marking 
letters in odd positions will lead to the same contradiction with Lemma 11, 
hence /(l) = 113X^\ where \X^\ > 1. 

Suppose \X^\ = 1, that is /(l) = 1131. Then / 2 (1) = 11311131^1131 
and the marked letter does not coincide with the letter of w a standing in the 
same place, hence /(l) = 1131XW, where \X^\ > 1. 

If \X\ is odd, then the marked letters in / 2 (1) = 1131X( 4 )ll31X( 4 ) . . . 
are two consecutive letters in odd places. This contradicts Lemma 11. Hence 
\X\ is even. 

We have / 2 (1) = XX ... = X1131X (4) . . . , whence the next-to-last letter 
of X is in an odd position and is equal to 3, since otherwise two consecutive 1 
in w a stand at odd places, which contradicts Lemma 11. The natural number 
which corresponds to the next-to-last letter of X is written as 2°(4s + 3), the 
next number is equal to \X\ and to 2°(4s + 3) + 1 = 4(s + l) = (mod 4). □ 

The following Lemma is straightforward to prove. 

Lemma 13. If m = 2' 1 (4s i + 1), n 2 = 2* 2 (4s 2 + 1), n 3 = 2* 3 (4s 3 + 3) and 
77-4 = 2* 4 (4s4 + 3) then nin 2 , n^n^ can be written as 2*(4s + 1), and nin 3 as 
2*(4s + 3). 

It follows from Lemma 12 that \X\ — At. 

Suppose X ends with 1 (the case when X ends with 3 is similar), that is 
at the (4t)th position in X we have 1. According to the multiplication by 2 
does not change a, so at the (2t)th position in X we have 1. 
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Consider / 2 (1) = XX.... The letters of the marked X occupy the 
positions of / 2 (1) from (At + l)th to (8t)th. Since X = X, then at the (6*)th 
place we have 1. But 6t = 3(2t), whence, by Lemma 13, at the (2i)th and 
the (6t)th places there must stand different letters. This is a contradiction 
and Theorem 10 is proved. □ 
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